# High-resolution human segmentation
Sapiens Seg 0.6b Bfloat16
Sapiens is a family of Vision Transformer models pre-trained on 300 million 1024x1024 resolution human images, focusing on human-centric vision tasks.
Image Segmentation English
S
facebook
24
0
Sapiens Seg 0.3b
Sapiens is a family of Vision Transformer models pre-trained on 300 million 1024×1024 resolution human images, focusing on human-centric vision tasks.
Image Segmentation English
S
facebook
48
2
Sapiens Seg 0.3b Torchscript
Sapiens is a family of vision Transformer models pre-trained on 300 million 1024 x 1024 resolution human images, supporting 1K high-resolution inference, demonstrating exceptional generalization to real-world data even with scarce or entirely synthetic labeled data.
Image Segmentation English
S
facebook
56
0
Sapiens Seg 1b Torchscript
Sapiens is a series of vision transformers pre-trained on 300 million 1024×1024 resolution human images, specifically designed for human-centric vision tasks with exceptional generalization capabilities.
Image Segmentation English
S
facebook
892
1
Featured Recommended AI Models